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Abstract 

We  study  economic  incentives  for  influencing  selfish  behavior  in  networks.  We 
consider  a  model  of  selfish  routing  in  which  the  latency  experienced  by  network  traffic 
on  an  edge  of  the  network  is  a  function  of  the  edge  congestion,  and  network  users  are 
assumed  to  selfishly  route  traffic  on  minimum- latency  paths.  The  quality  of  a  routing 
of  traffic  is  historically  measured  by  the  sum  of  all  travel  times,  also  called  the  total 
latency. 

It  is  well  known  that  the  outcome  of  selfish  routing  (a  flow  at  Nash  equilibrium )  does 
not  minimize  the  total  latency,  and  that  marginal  cost  pricing — charging  each  network 
user  for  the  congestion  effects  caused  by  its  presence — eliminates  the  inefficiency  of 
selfish  routing.  However,  the  principle  of  marginal  cost  pricing  assumes  that  taxes 
cause  no  disutility  to  network  users;  this  is  appropriate  only  when  collected  taxes  can 
be  feasibly  returned  (directly  or  indirectly)  to  the  users.  If  this  assumption  does  not 
hold  and  we  wish  to  minimize  the  total  user  disutility  (latency  plus  taxes  paid) — the 
total  cost — how  should  we  price  the  network  edges?  Intuition  may  suggest  that  taxes 
can  never  improve  the  cost  of  a  Nash  equilibrium,  but  the  famous  Braess’s  Paradox 
shows  this  intuition  to  be  incorrect. 

We  consider  strategies  for  pricing  network  edges  to  reduce  the  cost  of  a  Nash  equi¬ 
librium.  Since  levying  a  sufficiently  large  tax  on  an  edge  effectively  removes  it  from 
the  network,  our  study  generalizes  previous  work  on  designing  networks  for  selfish 
users  [36].  In  this  paper,  we  prove  the  following  results. 

•  In  a  large  class  of  networks — including  all  networks  with  linear  latency  functions — 
marginal  cost  taxes  do  not  improve  the  cost  of  a  Nash  equilibrium. 

*  A  preliminary  version  of  this  paper  appeared  in  the  Proceedings  of  the  4th  ACM  Conference  on  Electronic 
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•  The  largest-possible  benefit  from  taxes  does  not  exceed  that  from  edge  removals. 
In  every  network  with  linear  latency  functions,  the  benefit  of  taxes  cannot  exceed 
that  of  removing  edges.  There  are  networks  with  nonlinear  latency  functions, 
however,  in  which  taxes  are  radically  more  powerful  than  edge  removals. 

•  For  every  e  >  0,  there  is  no  (|— e)-approximation  algorithm  for  computing  optimal 
taxes,  even  in  networks  with  linear  latency  functions  (assuming  P  NP). 


1  Introduction 

1.1  Selfish  Routing  and  Marginal  Cost  Pricing 

We  study  economic  incentives  for  influencing  selfish  behavior  in  networks.  We  focus  on  a 
simple  model  of  selfish  routing,  defined  by  Wardrop  [44]  and  first  studied  in  the  theoretical 
computer  science  literature  by  Roughgarden  and  Tardos  [39].  In  this  model,  we  are  given 
a  directed  network  in  which  each  edge  possesses  a  latency  function  describing  the  common 
delay  experienced  by  all  traffic  on  the  edge  as  a  function  of  the  edge  congestion.  There  is  a 
fixed  amount  of  traffic  that  wishes  to  travel  from  a  source  vertex  s  to  a  sink  vertex  t;  as  in 
most  earlier  works,  we  assume  that  the  traffic  comprises  a  very  large  population  of  users,  so 
that  the  actions  of  a  single  individual  have  negligible  impact  on  the  network  congestion.  The 
quality  of  an  assignment  of  traffic  to  s-t  paths  is  historically  measured  by  the  resulting  sum 
of  all  travel  times — the  total  latency.  We  assume  that  each  network  user,  when  left  to  its 
own  devices,  acts  selfishly  and  routes  itself  on  a  minimum-latency  path,  given  the  network 
congestion  caused  by  the  other  users.  In  general  such  a  “selfish”  assignment  of  traffic  to 
paths  (a  flow  at  Nash  equilibrium )  does  not  minimize  the  total  latency;  put  differently,  the 
outcome  of  selfish  behavior  can  be  improved  upon  with  coordination. 

The  inefficiency  of  selfish  routing  motivates  the  introduction  of  economic  incentives  to 
ensure  that  selfish  behavior  results  in  a  socially  desirable  routing  of  traffic.  An  ancient  idea — 
discussed  informally  as  early  as  1920  [35] — is  to  use  marginal  cost  pricing.  The  principle  of 
marginal  cost  pricing  asserts  that  on  each  edge,  each  network  user  on  the  edge  should  pay  a 
tax  equal  to  the  additional  delay  its  presence  causes  for  the  other  users  on  the  edge.  Assuming 
that  all  network  users  choose  routes  to  minimize  the  sum  of  the  latency  experienced  and  the 
taxes  paid,  this  principle  ensures  that  the  resulting  flow  at  Nash  equilibrium  achieves  the 
minimum-possible  total  latency  [5].  Briefly,  the  inefficiency  of  selfish  routing  can  always  be 
eradicated  by  pricing  network  edges  appropriately. 

The  following  observation  motivates  our  work:  the  principle  of  marginal  cost  pricing  is 
single-minded  in  its  pursuit  of  a  minimum- latency  flow,  and  ignores  the  disutility  to  network 
users  due  to  (possibly  very  large)  taxes.  This  assumption  is  only  appropriate  when  collected 
taxes  can  be  feasibly  returned  (directly  or  indirectly)  to  the  network  users,  for  example  by 
refunding  taxes  equally  to  all  users  (a  “lump-sum  refund”).  In  this  paper,  we  are  interested  in 
settings  where  this  assumption  is  not  reasonable.  For  example,  refunding  the  collected  taxes 
to  network  users  could  be  logistically  or  economically  infeasible,  or  taxes  could  represent 
quantities  of  a  non-monetary,  non-refundable  good  such  as  time  delays. 


2 


/  (x)  =  X 


/(x)  =  l 


Figure  1:  Braess’s  Paradox 


If  we  wish  to  minimize  the  total  user  disutility — latency  plus  taxes  paid,  a  quantity  we 
call  the  cost — rather  than  merely  the  total  latency,  how  should  we  price  the  network  edges? 

1.2  Braess’s  Paradox  and  the  Power  of  Taxes 

Intuition  may  suggest  that  taxes,  which  can  only  increase  the  disutility  incurred  by  users 
along  an  s-t  path,  can  never  improve  the  cost  of  a  flow  at  Nash  equilibrium.  We  next 
describe  Braess’s  Paradox  [7],  in  a  form  first  described  by  Schulman  [40],  which  shows  that 
this  intuition  is  incorrect. 

Each  edge  of  the  network  in  Figure  1  is  labeled  with  its  latency  function,  giving  the  delay 
incurred  by  traffic  on  the  link  as  a  function  of  the  amount  of  traffic  that  uses  the  link.  (We 
assume  that  there  is  one  unit  of  traffic  overall.)  In  the  (unique)  flow  at  Nash  equilibrium, 
all  traffic  uses  route  the  s  — >  v  — >  w  — >  t  and  experiences  two  units  of  latency.  On  the 
other  hand,  if  at  least  half  a  unit  of  tax  is  levied  on  the  edge  ( v,w ),  then  in  the  flow  at 
Nash  equilibrium  half  of  the  traffic  uses  each  of  the  routes  s  — >  v  — >  t  and  s  — »  w  — »  t.  In 
particular,  the  path  s  — >  v  — >  w  — >  t  has  latency  1  and  cost  at  least  3/2  with  respect  to 
this  flow,  and  hence  does  not  offer  an  attractive  alternative  to  traffic.  In  this  new  flow  at 
Nash  equilibrium,  everyone  experiences  latency  3/2  and  no  taxes  are  paid.  This  outcome 
has  cost  3/2  and  is  clearly  superior  to  the  original  flow  at  Nash  equilibrium  in  the  absence 
of  taxes. 

For  contrast,  we  next  discuss  the  edge  taxes  that  are  dictated  by  the  principle  of  marginal 
cost  pricing  for  the  network  of  Figure  1.  As  we  will  see  in  Subsection  3.1,  these  taxes  are  1/2 
on  the  edges  (s,  v)  and  (w,  t )  and  0  on  the  other  three  edges.  With  these  taxes,  we  obtain 
the  same  flow  at  Nash  equilibrium  as  in  our  previous  solution  with  taxes,  and  all  traffic 
experiences  latency  3/2.  However,  all  traffic  must  also  pay  1/2  unit  of  tax.  This  solution 
thus  has  a  cost  of  2;  evidently,  the  previous  solution  should  be  preferred  over  marginal  cost 
taxes  in  this  example. 

The  potential  power  of  taxes  to  improve  the  cost  of  a  flow  at  Nash  equilibrium,  together 
with  the  inadequacy  of  marginal  cost  pricing  for  this  goal,  motivate  the  questions  that  we 
study  in  this  paper. 
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Section 

Problem  Studied 

Linear 

Arbitrary 

3 

Do  marginal  cost  taxes  ever  help? 

no 

yes 

4 

Maximum  benefit  of  taxes 

4/3 

n/2 

5 

Are  taxes  more  powerful  than  edge  removals? 

no 

yes 

6 

Approximability  of  optimal  taxes 

4/3 

[4/3,  n/2] 

Table  1:  Contributions  of  this  paper.  All  results  depend  on  the  set  of  allowable  edge  latency 
functions.  For  simplicity,  we  list  only  results  for  networks  with  linear  or  arbitrary  latency 
functions.  The  benefit  of  taxes  is  measured  by  the  ratio  in  Nash  flow  cost  before  and  after 
taxes  are  levied. 


(1)  Are  marginal  cost  taxes  ever  a  good  idea  for  minimizing  the  cost  of  a  flow  at  Nash 
equilibrium? 

(2)  A  sufficiently  large  edge  tax  effectively  removes  the  edge  from  the  network,  and  the 
power  of  removing  edges  to  improve  a  flow  at  Nash  equilibrium  is  well  understood  [36]. 
Taxes  are  thus  at  least  as  powerful  as  edge  removals;  when  are  they  strictly  more 
powerful? 

(3)  Can  we  compute  or  approximate  optimal  taxes  efficiently? 

1.3  Our  Results 

Our  contributions  on  these  three  questions  are  proofs  of  the  following  results. 

•  In  every  network  with  linear  latency  functions,  marginal  cost  taxes  do  not  improve  the 
cost  of  a  flow  at  Nash  equilibrium. 

•  The  maximum-possible  benefit  of  taxes  is  no  more  than  that  of  edge  removals. 

•  For  every  network  with  linear  latency  functions — not  merely  worst-case  examples — 
taxes  cannot  decrease  the  cost  of  a  flow  at  Nash  equilibrium  beyond  what  can  be 
achieved  by  removing  edges.  By  contrast,  there  are  networks  with  nonlinear  latency 
functions  in  which  taxes  can  radically  improve  over  the  best  subgraph  solution. 

•  For  every  e  >  0,  there  is  no  (|  —  e)-approximation  algorithm  for  computing  optimal 
taxes,  even  in  networks  with  linear  latency  functions  (assuming  P  ^  NP).  For  net¬ 
works  with  linear  latency  functions,  this  hardness  result  is  optimal. 

Table  1  summarizes  our  results.  Determining  the  maximum-possible  benefit  of  taxes  and 
the  (in)approximability  of  the  problem  of  computing  optimal  taxes  require  only  reasonably 
straightforward  extensions  of  existing  work  on  network  design  [36];  all  other  results  of  this 
paper  require  new  constructions  and  proof  approaches. 
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1.4  Related  Work 


As  we  have  noted,  marginal  cost  pricing  in  selfish  routing  networks  was  first  proposed  by 
Pigou  [35].  The  model  of  selfish  routing  studied  in  this  paper  was  first  mathematically 
formalized  in  the  1950s  by  Wardrop  [44]  and  Beckmann,  McGuire,  and  Winsten  [5],  and  has 
been  extensively  studied  ever  since.  Further  discussion  and  many  more  references  on  both 
selfish  routing  and  related  network  models  can  be  found  in  [38]. 

We  make  no  attempt  to  survey  the  vast  literature  on  the  optimal  pricing  of  shared 
resources,  and  mention  only  a  few  references  that  can  serve  as  a  starting  point  for  further 
reading.  Surveys  and  recent  work  on  pricing  selfish  routing  networks  by  the  transportation 
science  community  include  [4,  6,  13,  14,  16,  22,  23,  41].  There  have  also  been  several  recent 
theoretical  computer  science  papers  on  the  topic  [9,  17,  18,  24],  While  researchers  have 
long  realized  that  marginal  cost  taxes  may  cause  users  to  pay  more  than  is  necessary  (see 
e.g.  [6,  22]),  none  of  the  above  papers  incorporated  the  taxes  paid  into  the  definition  of  social 
welfare  as  in  the  present  work.  Beyond  selfish  routing  networks,  there  has  been  an  enormous 
amount  of  research  on  pricing  in  various  network  models;  see  [1,  8,  20,  26,  30,  31]  and  the 
references  therein  for  many  examples. 

We  note  that  much  of  the  aforementioned  work  can  be  interpreted  in  the  context  of 
general  microeconomic  theory;  see  the  survey  by  Mirrlees  [33],  for  example,  for  discussion 
and  references  on  this  point.  We  are  not,  however,  aware  of  any  work  in  the  economics 
literature  that  directly  applies  to  the  questions  posed  in  this  paper. 

Some  of  the  issues  that  we  study  are  similar  in  spirit  to  the  work  on  “frugal  mechanisms” 
pioneered  by  Archer  and  Tardos  [2,  3]  and  studied  further  in  [15,  25,  32,  42],  These  papers 
seek  mechanisms  (such  as  auctions)  that  solve  an  optimization  problem  in  an  incentive- 
compatible  way,  but  also  make  use  of  only  moderate  incentives. 

The  paper  closest  to  the  present  work  is  that  of  Roughgarden  [36]  on  designing  networks 
for  selfish  users.  The  central  questions  of  [36]  concern  the  maximum-possible  benefit  from 
and  the  algorithmic  complexity  of  removing  edges  from  a  network  with  selfish  routing.  These 
questions  can  be  viewed  as  special  cases  of  some  of  the  problems  considered  here,  with  edge 
taxes  restricted  to  be  either  0  or  +oo.  This  paper  extends  some  of  the  results  of  [36]  to  the 
setting  of  more  general  taxes,  and  in  addition  tackles  problems  that  have  not  been  previously 
considered. 

1.5  Organization 

In  Section  2  we  formally  define  our  traffic  routing  model  and  review  useful  results  from  past 
works.  In  Section  3  we  ask  if  marginal  cost  pricing  can  improve  the  cost  of  selfish  routing, 
and  we  resolve  this  question  in  the  negative  for  networks  with  linear  latency  functions.  In 
Sections  4  and  5  we  ask  if  the  power  of  taxes  exceeds  that  of  edge  removals.  In  Section  4 
we  adapt  previous  work  on  network  design  [36]  to  show  that  the  largest-possible  decrease 
in  cost  due  to  taxes  cannot  exceed  that  due  to  edge  removals.  In  Section  5  we  show  that 
taxes  never  improve  over  the  best  solution  achievable  by  removing  edges  in  networks  with 
linear  latency  functions,  but  can  be  radically  more  powerful  than  edge  removals  in  networks 
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with  arbitrary  latency  functions.  Section  6  studies  the  algorithmic  problem  of  computing 
taxes  to  minimize  the  cost  of  a  flow  at  Nash  equilibrium.  Section  7  concludes  with  several 
suggestions  for  future  research. 

2  Preliminaries 

2.1  Networks  and  Flows 

We  follow  the  notation  and  conventions  of  Roughgarden  and  Tardos  [39].  We  study  a  single¬ 
commodity  flow  network,  described  by  a  directed  graph  G  =  (V,  E )  with  a  source  vertex  s 
and  a  sink  vertex  t.  We  allow  parallel  edges  but  have  no  use  for  self-loops.  We  denote  the  set 
of  simple  s-t  paths  by  V,  and  we  assume  that  this  set  is  nonempty.  A  flow  f  is  a  nonnegative 
vector,  indexed  by  V.  For  a  fixed  flow  /  we  define  fe  =  'fflpe'P-etP  fp  as  the  amount  of  traffic 
using  edge  e  en  route  from  s  to  t.  With  respect  to  a  finite  and  positive  traffic  rate  r,  a  flow 
/  is  said  to  be  feasible  if  fp  =  r. 

The  network  G  suffers  from  congestion  effects;  to  model  this,  we  assume  that  each  edge 
e  possesses  a  nonnegative,  continuous,  nondecreasing  latency  function  le  that  describes  the 
delay  incurred  by  traffic  on  e  as  a  function  of  the  edge  congestion  fe.  The  latency  of  a  path 
P  in  G  with  respect  to  a  flow  /  is  then  given  by  (p(f)  =  J2eeP  ^e(fe)-  The  quality  of  a  flow 
is  historically  measured  by  its  total  latency  L(f),  defined  by 

£(/)  =  Y1  W)/p  =  E  We)/e. 

P&V  e€ E 

where  the  equality  follows  by  writing  £p(f)  as  a  sum  over  edges  and  reversing  the  order  of 
summation.  We  will  call  a  flow  minimizing  Lf)  optimal  or  minimum-latency.  An  optimal 
flow  always  exists,  as  the  space  of  all  flows  is  compact  and  L(-)  is  a  continuous  function. 

Finally,  we  allow  a  set  of  nonnegative  taxes  {re}e£E  to  be  placed  on  the  edges  of  a  network 
G.  We  write  rP  =  re  for  the  total  taxes  on  a  path  P.  The  cost  C(f,  r)  of  a  flow  /  in  a 

network  with  taxes  r  is  the  total  disutility  caused  to  network  users,  accounting  for  disutility 
due  to  both  latency  and  taxes: 

C(f,T)  =  J>P(/)  +  Tp\fp  =  ^[teife)  +  Te\fe. 

P&V  e€ E 

The  functions  L(-)  and  Cf,r )  coincide  if  r  =  0.1  We  call  a  triple  ( G,r,£ )  an  instance ,  and 
use  the  notation  (G,  r,  £  +  r)  to  denote  an  instance  in  which  taxes  r  have  been  levied  on  the 
edges  of  G. 

2.2  Flows  at  Nash  Equilibrium 

We  assume  that  noncooperative  behavior  results  in  a  Nash  equilibrium — a  “stable  point”  in 
which  no  traffic  has  an  incentive  to  unilaterally  alter  its  strategy  (i.e. ,  its  route  from  s  to  t). 

1In  several  previous  papers  on  selfish  routing  (without  taxes),  such  as  [37,  39],  the  terms  total  latency 
and  cost  were  used  synonymously. 
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We  also  assume  that  all  agents  seek  to  minimize  the  sum  of  the  latency  experienced  and  the 
tax  paid.2  We  therefore  expect  that,  in  a  flow  at  Nash  equilibrium,  all  traffic  is  routed  on 
paths  with  minimum-possible  latency  plus  tax.  Formally,  we  have  the  following  definition. 

Definition  2.1  A  flow  /  feasible  for  (G,  r,  £  +  r)  is  at  Nash  equilibrium  or  is  a  Nash  flow  if 
for  all  Pi  .If  £  V  with  fpl  >0, 


Ppiif)  +  rPi  <  4>2(/)  +  rP2- 

We  next  discuss  several  useful  properties  of  flows  at  Nash  equilibrium  in  single-commodity 
networks.  None  of  these  results  are  new;  for  proofs,  see  the  original  research  papers  or  the 
book  by  Roughgarden  [38]. 

First  is  an  alternative  definition  of  a  Nash  flow,  which  is  a  simple  consequence  of  the  fact 
that  such  a  flow  routes  traffic  only  on  shortest  paths  with  respect  to  latencies  plus  taxes. 

Proposition  2.2  ([36])  Let  f  be  a  flow  feasible  for  (G,  r,  £  +  r).  For  a  vertex  v  in  G,  let 
d(v)  denote  the  length,  with  respect  to  edge  lengths  4(/e)  +  re,  of  a  shortest  s-v  path  in  G. 
Then 

d{w)  -  d(v)  <  4(/e)  +  Te 

for  all  edges  e  =  (v,w),  and  f  is  at  Nash  equilibrium  if  and  only  if  equality  holds  whenever 

fe>  0. 

An  immediate  consequence  of  Proposition  2.2  is  that  the  property  of  being  at  Nash  equilib¬ 
rium  is  a  property  only  of  the  flow  vector  on  edges  {/e}  induced  by  a  flow  /,  rather  than  on 
the  particular  path  decomposition. 

Corollary  2.3  Suppose  /,  /  are  flows  for  ( G,r,£  +  r)  with  fe  =  fe  for  all  edges  e.  Then  f 
is  at  Nash  equilibrium  if  and  only  if  f  is  at  Nash  equilibrium. 

We  next  discuss  the  existence  and  uniqueness  of  Nash  flows. 

Proposition  2.4  ([5,  12])  Every  instance  (G,  r,  £  +  t)  admits  a  directed  acyclic  flow  at 
Nash  equilibrium. 

Proposition  2.5  ([5,  12])  If  f,  f  are  flows  at  Nash  equilibrium  for  (G,r,£  +  t),  then: 

(a)  4(/e)  =  4 (fe)  for  all  edges  e; 

(b)  C(f,r)  =  C(f,  t). 

The  next  proposition  states  that  all  paths  used  by  a  Nash  flow  have  the  same  combined 
latency  and  tax,  and  that  the  cost  of  a  Nash  flow  is  therefore  expressible  in  a  very  simple 
form.  It  follows  easily  from  Definition  2.1  and  the  definition  of  the  cost  a  flow. 

2This  assumption,  while  classical,  is  obviously  quite  strong.  In  general,  we  expect  different  agents  to  have 
different  objective  functions  and  to  trade  off  time  and  money  in  different  ways.  This  objection  raises  several 
interesting  issues  that  were  studied  in  a  recent  sequence  of  papers  [9,  17,  18,  24], 
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Figure  2:  The  second  and  third  Braess  graphs.  Edges  are  labeled  with  their  type. 


Proposition  2.6  ([44])  Let  f  be  at  Nash  equilibrium  for  the  instance  (■ G,r,£  +  r).  Then, 
there  is  a  constant  c  >  0  such  that: 

(a)  £p(f)  +  Tp  =  c  whenever  fp  >  0; 

(b)  C(f,  t)  =  r  ■  c. 

Propositions  2.5(b)  and  2.6(b)  imply  that  the  constant  c  in  Proposition  2.6  is  independent 
of  the  chosen  Nash  flow  /  for  (G,  r,  £  +  t).  We  can  therefore  adopt  the  notation  c(G,  r,  £  +  t) 
for  the  value  of  this  constant  for  the  instance  (G,  r,  £  +  r). 

Our  final  proposition  states  that,  for  fixed  G,  £,  and  r,  the  value  of  c(G,r,£  +  r)  is 
nondecreasing  in  r.  This  fact  was  first  proved  by  Hall  [21]  and  we  will  use  it  in  Section  3. 
For  a  combinatorial  proof  of  the  proposition,  see  Lin,  Roughgarden,  and  Tardos  [28]. 

Proposition  2.7  ([21])  The  value  c(G,r,£  +  r)  is  nondecreasing  in  r. 

2.3  The  Braess  Graphs 

In  this  subsection  we  review  the  “Braess  graphs”.  These  networks  were  first  defined  by 
Roughgarden  [36]  to  show  that  removing  edges  from  a  network  can  decrease  the  total  latency 
of  a  Nash  flow  by  an  unbounded  amount.  As  levying  taxes  can  be  viewed  as  a  natural 
generalization  of  removing  edges,  these  networks  will  also  play  an  important  role  in  this 
paper. 

The  kth  Braess  graph  Bk  =  ( Vk ,  Ek )  is  defined  as  follows:  start  with  a  set  Vk  = 
(s,  i>i, . . . ,  Vk,  w\, . . . ,  Wk,  t}  of  2k  +  2  vertices  and  define  Ek  by  {(s,  vf),  (ig,  wf),  ( wy ,  t)  :  1  < 
i  <  k}  U  {(vi,  iUj_i)  :  2  <  i  <  k}  U  {(ui,  t),  (s,  Wk)}  (see  Figure  2).  The  graph  B 1  is  the  same 
as  Braess’s  original  example  (Figure  1). 
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(a)  Nash  flow  for  [B3, 3,£3) 


(b)  Nash  flow  in  the  optimal  subgraph 


Figure  3:  Proof  of  Proposition  2.8,  when  k  —  3.  Solid  edges  carry  traffic  in  the  flow  at 
Nash  equilibrium,  dashed  edges  do  not.  Edge  latencies  are  with  respect  to  flows  at  Nash 
equilibrium. 


To  describe  latency  functions  for  and  flows  in  Bk  in  a  convenient  way,  we  introduce 
additional  terminology.  We  call  edges  of  the  form  {v^wf)  type  A  edges ;  edges  of  the  form 
1),  (i’i ,t),  or  (s,Wk)  type  B  edges ;  and  edges  of  the  form  (s,ty)  or  ( Wi,t )  type  C  edges 
(see  Figure  2).  For  i  =  1  let  Pi  denote  the  path  s  — >  Vi  — >  Wi  — >  t.  Finally,  for 

i  —  2, . . . ,  k,  let  Qi  denote  the  path  s  — >  Vi  — >  Wi- 1  — >  f;  define  Q i  to  be  the  path  s  — »  v\  — >  £ 

and  Qfc+i  the  path  s  Wk  t. 

Braess  graphs  show  that  the  latency  of  a  Nash  flow  in  an  n- vertex  network  can  be 
improved  by  an  \n/ 2j  factor  by  removing  \nj 2J  —  1  edges  from  the  network. 

Proposition  2.8  ([36])  For  every  n  >  2,  removing  edges  from  a  network  with  n  vertices 
and  arbitrary  latency  functions  can  decrease  the  total  latency  of  a  Nash  flow  by  a  factor 
of  [n/2j . 

Proof:  Fix  n,  which  we  can  assume  is  at  least  4,  and  let  k  =  \n/ 2J  —  1.  Define  latency 
functions  ik  on  the  edges  of  Bk  as  follows:  type  A  edges  have  constant  latency  0;  type  B 
edges  have  constant  latency  1;  and  for  each  i  e  {1,2,...,  k},  the  type  C  edges  (tUj,  t)  and 
(s,  Vk-i+i)  receive  a  continuous,  nondecreasing  latency  function  £k  satisfying  £k(k/(k+ 1))  =  0 
and  £g(l)  =  i. 

Routing  one  unit  of  flow  on  each  of  the  paths  Pl} ,  P^  gives  a  flow  at  Nash  equilibrium 
for  the  instance  ( Bk,kflk )  in  which  all  traffic  experiences  latency  k  +  1  (see  Figure  3(a)). 
Removing  the  type  A  edges  from  Bk  produces  a  subgraph  Ft,  and  routing  k/(k  +  1)  units  of 
flow  on  each  of  the  paths  Q i, ,  Qk+i  yields  a  flow  at  Nash  equilibrium  for  ( H ,  k,  £k )  with 
all  traffic  incurring  latency  1  (see  Figure  3(b)).  ■ 

We  will  also  study  networks  with  edge  latency  functions  that  are  degree-bounded  poly- 
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normals  with  nonnegative  coefficients.  The  following  analogue  of  Proposition  2.8  holds  for 
such  networks. 

Corollary  2.9  ([36])  Removing  edges  from  a  network  with  latency  functions  that  are  poly¬ 
nomials  with  degree  at  most  p  and  nonnegative  coefficients  can  decrease  the  total  latency  of 
a  Nash  flow  by  an  f l(p/  logp)  factor  as  p  — >  oo. 

Corollary  2.9  can  be  proved  with  a  modification  of  the  construction  used  in  Proposi¬ 
tion  2.8,  using  the  kth  Braess  graph  with  k  ~  p/  logp.  The  traffic  rate  and  latency  functions 
are  identical  to  those  in  the  proof  of  Proposition  2.8,  except  that  a  type  C  edge  of  the  form 
(wi,t)  or  (s,Vfc_;+i)  receives  the  latency  function  ixp. 

Finally,  a  lower  bound  on  the  benefit  achievable  with  edge  removals  in  networks  with 
linear  latency  functions  follows  from  the  network  in  Figure  1. 

Corollary  2.10  ([7,  40])  Removing  edges  from  a  network  with  linear  latency  functions  can 
decrease  the  total  latency  of  a  Nash  flow  by  a  factor  of  4/3. 

3  When  Do  Marginal  Cost  Taxes  Help? 

In  this  section,  we  study  the  cost  of  applying  the  principle  of  marginal  cost  pricing.  In 
Subsection  3.1  we  formalize  marginal  cost  taxes  and  the  classical  guarantee  that  they  induce 
the  minimum- latency  flow  as  a  flow  at  Nash  equilibrium.  In  Subsection  3.2,  however,  we 
show  that  marginal  cost  taxes  cannot  decrease  the  cost  of  a  Nash  flow  in  a  network  with 
linear  latency  functions. 

3.1  Marginal  Cost  Taxes  Minimize  Latency 

Recall  that  the  principle  of  marginal  cost  pricing  posits  that  each  user  should  pay  a  tax  equal 
to  the  additional  delay  other  users  experience  because  of  its  presence.  Mathematically,  this 
principle  asserts  that  in  a  flow  /  feasible  for  the  instance  (G,  r,  £),  the  tax  re  assigned  to  edge 
e  should  be  re  —  fe  ■  £'e(fe),  where  £'e  denotes  the  derivative  of  £e.  (Assume  for  simplicity 
that  the  latency  functions  are  differentiable.)  The  term  £'e(fe)  corresponds  to  the  marginal 
increase  in  latency  caused  by  one  user  on  the  edge,  and  the  term  fe  is  the  amount  of  traffic 
that  suffers  from  this  increase.  Marginal  cost  taxes  come  with  the  following  guarantee,  which 
is  classical  (see  [5,  12,  38]  for  proofs). 

Proposition  3.1  Let  (G,r,£)  be  an  instance  with  differentiable  latency  functions,  admitting 
a  minimum-latency  flow  /*.  Let  re  =  /*  ■  £'e(ff)  denote  the  marginal  cost  tax  for  edge  e  with 
respect  to  f*.  Then  f*  is  at  Nash  equilibrium  for  ( G ,  r,  £  +  r). 

In  words,  marginal  cost  taxes  induce  the  minimum- latency  flow  as  a  flow  at  Nash  equilibrium. 
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3.2  Marginal  Cost  Taxes  Increase  Cost 

Proposition  3.1  shows  how  to  minimize  the  total  latency  of  a  Nash  flow  with  edge  taxes.  But 
how  effective  are  marginal  cost  taxes  when  we  also  account  for  the  disutility  to  traffic  due  to 
taxes?  We  have  already  seen,  in  Subsection  1.2,  that  marginal  cost  taxes  need  not  minimize 
the  cost  of  a  flow.  Our  next  theorem  identifies  a  reasonably  large  class  of  networks — networks 
in  which  all  latency  functions  are  linear,  with  the  form  £{x)  =  ax  +  b — in  which  marginal 
cost  taxes  are  guaranteed  to  be  unnecessary,  if  not  detrimental.  This  result  illustrates  the 
dangers  of  marginal  cost  pricing  when  minimizing  latency  is  not  the  sole  goal. 

Theorem  3.2  Let  f*  and  f  be  minimum-latency  and  Nash  flows,  respectively,  for  an  in¬ 
stance  ( G ,  r,  £)  with  linear  latency  functions.  Let  r  denote  the  marginal  cost  taxes  with  respect 
to  f*.  Then, 

C(f,  0)  <  C(f*,r). 

Proof:  Let  (G,r,  l)  be  an  instance  with  linear  latency  functions,  with  £e(x)  =  aex  +  be  for 
each  edge  e  (with  ae,  be>  0).  Let  /*  and  /  be  minimum- latency  and  Nash  flows,  respectively, 
for  ( G,r,£ ).  The  principle  of  marginal  cost  pricing  dictates  that  re  —  f*  ■  £'e(.ff)  =  aeff  for 
each  edge  e. 

Define  the  modified  latency  function  £*e  by  £*e(x)  =  2 aex  +  be.  The  functions  l  +  r  and  £* 
are  not  identically  equal,  but  the  identity 

We*)  =  2«e/;  +  be  =  te(ff)  +  Te 

holds  for  every  edge  e.  Proposition  3.1  thus  guarantees  that  /*  is  at  Nash  equilibrium  not 
only  for  the  instance  ( G,r,£  +  r),  but  also  for  the  instance  (G,  r,  £*).  Moreover,  in  the 
notation  of  Proposition  2.6, 

c(G,r,C)  —  c(G,  r,£  +  r).  (1) 

We  next  claim  that  f/2  is  at  Nash  equilibrium  for  (G,r/ 2,£*)  with 

C(G,  r /2,  £*)  =  c(G,  r,£).  (2) 

To  see  why,  we  note  that  since  £e(x)  =  aex  +  be  and  £*e{x )  =  2aex  +  be ,  edge  and  path  latencies 
with  respect  to  f/2  in  (G,r/2,£*)  and  with  respect  to  /  in  ( G,r,£ )  are  identical.  That  /  is 
a  Nash  flow  for  ( G,r,£ )  then  implies  that  f/2  is  at  Nash  equilibrium  for  (G,r/ 2,£*),  with 
c(G,r/2,£*)  =  c(G,r,£).3 

Combining  (1)  and  (2)  with  Proposition  2.7,  we  obtain 

c(G,  r,  £)  =  c(G,  rf  2,  £*)  <  c(G,  r,  £*)  =  c(G,  r,  £  +  r). 

The  theorem  now  follows  immediately  from  Proposition  2.6.  ■ 

Remark  3.3  Theorem  3.2  concerns  only  networks  with  linear  latency  functions,  but  it  can 
easily  be  extended  to  networks  in  which,  for  some  fixed  p  >  0,  every  edge  e  has  a  latency 
function  of  the  form  aexp  +  be  with  ae,  be  >  0. 

3The  essence  of  this  proof  first  appeared  in  [39]. 
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Remark  3.4  There  are  networks  with  nonlinear  latency  functions  in  which  marginal  cost 
taxes  can  decrease  the  cost  of  a  Nash  flow.  For  example,  it  is  possible  to  define  such  an 
example  using  the  network  in  Figure  1.  The  latency  functions  employed  in  this  example 
are  non-convex,  and  are  similar  to  step  functions.  The  details  are  somewhat  tedious  and  we 
omit  them.  We  do  not  view  this  example  as  a  positive  result  for  marginal  cost  taxes  (as  the 
example  is  contrived),  but  rather  as  justification  for  restricting  the  network  latency  functions 
in  Theorem  3.2.  We  leave  open  the  question  of  whether  the  negative  result  of  Theorem  3.2 
can  be  proved  under  significantly  weaker  assumptions  on  the  latency  functions. 

4  How  Powerful  Are  Arbitrary  Taxes? 

In  this  section  we  study  the  following  question:  how  much  can  the  cost  of  a  Nash  flow 
decrease  after  levying  taxes  on  the  edges?  As  we  will  see,  a  precise  answer  to  this  question 
follows  easily  from  previous  work  on  the  power  of  edge  removals  [36]. 

The  maximum-possible  benefit  from  taxes  will  depend  crucially  on  the  allowable  network 
latency  functions.  This  dependence  is  characteristic  of  much  of  the  work  on  selfish  routing 
(see  e.g.  [38]).  Indeed,  we  have  already  seen  a  glimpse  of  such  a  dependence  in  the  previous 
section,  where  marginal  cost  pricing  can  decrease  the  cost  of  a  Nash  flow,  but  not  in  networks 
with  linear  latency  functions. 

Our  first  two  upper  bounds  on  the  maximum-possible  reduction  in  cost  due  to  taxes 
are  consequences  of  previous  work  on  the  price  of  anarchy  [27,  34],  The  price  of  anarchy 
of  a  set  of  selfish  routing  instances  is  the  largest  ratio  between  the  total  latency  of  a  Nash 
flow  of  an  instance  and  that  of  a  minimum- latency  flow  for  the  instance.  The  price  of 
anarchy  is  a  function  of  the  set  of  allowable  latency  functions,  and  this  dependence  is  by 
now  well  understood.  For  example,  the  following  statements  are  known  (see  [10,  37]  for 
further  examples). 

Proposition  4.1  ([39])  The  price  of  anarchy  in  networks  with  linear  latency  functions 
is  4/3. 

Proposition  4.2  ([37])  The  price  of  anarchy  in  networks  with  latency  functions  that  are 
polynomials  with  degree  at  most  p  and  nonnegative  coefficients  is  asymptotically  Q(p/ log  p) 
as  p  — ■>  oo . 

Upper  bounds  on  the  price  of  anarchy  directly  translate  to  upper  bounds  on  the  largest 
decrease  in  cost  achievable  with  taxes:  at  best,  taxes  replace  the  Nash  flow  in  the  original 
network  with  the  minimum-latency  flow,  while  causing  no  additional  disutility  to  network 
users.  We  therefore  have  the  following  corollaries. 

Corollary  4.3  Let  (■ G ,  r,  £)  be  an  instance  with  linear  latency  functions  and  r  a  tax  on  edges. 
Let  f  and  fT  be  Nash  flows  for  (G,r,£)  and  (■ G,r,£  +  r),  respectively.  Then 
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Corollary  4.4  There  is  a  constant  ci  >  0  such  that  the  following  statement  holds  for  all 
p  >  2.  If  ( G ,  r,  £)  is  an  instance  with  polynomial  latency  functions  with  degree  at  most  p  and 
nonnegative  coefficients,  t  is  a  tax  on  edges,  and  f  and  fT  are  Nash  flows  for  ( G ,  r,  £)  and 
(G,r,£  +  t),  respectively,  then 


Hf)  <  ci 


V 

logp 


C(fT,r). 


In  Subsection  2.3  we  reviewed  the  Braess  graphs,  which  give  lower  bounds  on  how  much 
deleting  edges  can  decrease  the  total  latency  of  a  Nash  flow.  As  we  have  noted,  sufficiently 
large  taxes  can  simulate  edge  deletions,  so  these  lower  bounds  carry  over  to  the  present 
setting.  In  particular,  Corollary  2.10  implies  that  the  bound  of  Corollary  4.3  is  the  best 
possible,  and  Corollary  2.9  demonstrates  that  Corollary  4.4  is  optimal  up  to  a  constant 
factor. 

In  networks  with  arbitrary  latency  functions,  the  price  of  anarchy  is  unbounded,  even 
in  two-node,  two-link  networks  [39].  While  this  might  suggest  that  no  finite  upper  bound 
on  the  largest-possible  benefit  of  taxes  is  possible  in  such  networks,  a  bounded  price  of 
anarchy  is  only  a  sufficient  (and  not  necessary)  condition  for  such  a  bound.  Indeed,  with 
no  assumptions  whatsoever  on  the  network  latency  functions,  we  can  still  obtain  an  upper 
bound  that  is  a  function  of  the  network  size. 


Theorem  4.5  Let  ( G,r,£ )  be  an  instance  with  n  vertices  and  r  a  tax  on  edges.  Let  f  and 
fT  be  Nash  flows  for  (G,  r,  £)  and  (G,  r,  £  +  r),  respectively.  Then 

L(f)<  [jj  -WV)- 


The  proof  of  Theorem  4.5  is  a  straightforward  extension  of  an  argument  from  [36,  Theo¬ 
rem  4.1],  which  proves  the  weaker  statement  that  deleting  edges  from  a  network  can  reduce 
the  total  latency  of  a  Nash  flow  by  at  most  an  \n/2\  factor.  We  give  the  proof  in  Appendix  A 
for  completeness.  Proposition  2.8  implies  that  the  bound  of  Theorem  4.5  is  the  best  possible. 


5  Are  Taxes  More  Powerful  Than  Edge  Removals? 

In  Section  4,  we  saw  that  there  is  a  strong  connection  between  the  power  of  taxes  and  the 
power  of  edge  removals.  Specifically,  we  found  that  for  several  natural  classes  of  networks, 
the  maximum-possible  reduction  in  cost  achievable  by  levying  taxes  on  edges  is  the  same  as 
that  by  removing  edges  from  the  network.  However,  we  have  not  resolved  whether  or  not 
there  exist  any  networks  in  which  taxes  can  improve  upon  the  best  solution  obtainable  by 
removing  edges.  In  other  words,  is  the  power  of  taxes  no  greater  than  that  of  edge  removals 
even  on  an  instance-by-instance  basis?  We  study  this  question  in  this  section. 

In  Subsection  5.1  we  show  that  the  answer  is  “yes”  in  networks  with  linear  latency 
functions:  in  every  such  network,  taxes  cannot  decrease  the  cost  of  a  Nash  flow  more  than 
edge  removals  can.  By  contrast,  in  Subsection  5.2  we  show  that  in  general  networks,  taxes 
can  reduce  the  cost  of  a  Nash  flow  far  beyond  what  is  achievable  by  merely  deleting  edges 
from  the  network. 
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5.1  The  Power  of  Edge  Removals  in  Networks  with  Linear  Latency 
Functions 

5.1.1  Overview 

In  this  subsection  we  consider  only  networks  with  linear  latency  functions.  Our  main  result 
is  that  taxes  are  never  more  powerful  than  edge  removals  in  these  networks.  To  state  this 
formally,  we  will  say  that  a  set  r  of  taxes  for  the  instance  (G,r,£)  is  O/oo  if,  for  some  Nash 
flow  fT  for  ( G,r,£  +  r),  re  =  0  or  ff  =  0  for  each  edge  e.  We  note  that  O/oo  taxes  are  no 
more  powerful  than  edge  removals,  since  if  r  is  O/oo  then  c(G,r,£  +  r)  =  c(H,r,£),  where 
H  is  the  subgraph  of  G  comprising  the  edges  with  zero  tax. 

We  abuse  notation  and,  with  respect  to  an  instance  (■ G,r,£ ),  write  C(r)  to  denote 
C(/r,r),  where  fT  is  at  Nash  equilibrium  for  (G,r,£  +  r).  The  function  C[t )  is  well  defined 
by  Proposition  2.5(b).  A  tax  vector  r*  is  optimal  for  an  instance  (G,  r,  £)  if  C(r*)  <  G(r)  for 
all  nonnegative  tax  vectors  r.  Because  there  are  an  infinite  number  of  possible  tax  vectors,  it 
is  not  even  obvious  that  every  instance  admits  an  optimal  set  of  taxes.  The  following  result 
establishes  the  stronger  statement  that  every  instance  (with  linear  latency  functions)  admits 
an  optimal  tax  vector  that  is  O/oo. 

Theorem  5.1  An  instance  with  linear  latency  functions  admits  an  optimal  set  of  taxes  that 
is  O/oo. 

The  proof  of  Theorem  5.1  is  fairly  involved.  To  avoid  considering  an  arbitrary  network 
with  linear  latency  functions,  we  will  argue  by  contradiction  and  study  a  minimal  counterex¬ 
ample.  As  we  will  see,  minimal  counterexamples  possess  several  convenient  properties  that 
facilitate  the  proof. 

Precisely,  a  counterexample  is  an  instance  (G,  r,  £)  with  linear  latency  functions  that 
admits  no  optimal  O/oo  tax.  A  counterexample  is  minimal  if  no  other  counterexample  has 
fewer  edges.  A  tax  r  is  good  for  (G,r,£)  if  C(r)  <  C(t)  for  all  O/oo  taxes  f.  If  (G,r,£) 
admits  a  good  tax,  then  it  is  clearly  a  counterexample.  We  are  not  yet  claiming  that  a 
counterexample  (G,r,£)  admits  an  optimal  tax;  conceivably,  the  value  inf{C(r)  :  r  >  0} 
is  not  attained  by  any  tax.  However,  since  the  set  (C'(r)  :  r  is  O/oo}  is  finite,  every 
counterexample  admits  a  good  tax. 

To  achieve  the  desired  contradiction,  we  will  also  need  a  tax  that  is  in  some  sense  minimal. 
Formally,  we  will  call  a  tax  minimal  for  an  instance  if  it  is  an  optimal  tax  and  minimizes 
Y2ee e  re  among  all  optimal  taxes  r.  Once  we  establish  that  minimal  taxes  exist  for  minimal 
counterexamples,  the  proof  of  Theorem  5.1  will  be  relatively  short.  Our  key  lemma  is  thus 
the  following. 

Lemma  5.2  A  minimal  counterexample  admits  a  minimal  tax. 

5.1.2  Properties  of  Minimal  Counterexamples 

The  proof  of  Lemma  5.2  makes  use  of  several  properties  of  minimal  counterexamples.  We 
establish  these  next. 
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First,  minimal  counterexamples  are  directed  acyclic  networks,  with  every  good  tax  in¬ 
ducing  a  flow  at  Nash  equilibrium  that  routes  flow  on  every  edge. 

Lemma  5.3  Let  ( G,r,£ )  be  a  minimal  counterexample,  r  a  good  tax,  and  fT  a  flow  at  Nash 
equilibrium  for  ( G ,  r,  £  +  r) .  Then  ff  >  0  for  all  edges  e  of  G. 

Proof:  If  fl  =  0  for  some  edge  e  of  G,  then  deleting  e  from  G  yields  a  counterexample  with 
fewer  edges.  ■ 

Corollary  5.4  If  (G,  r,  £)  is  a  minimal  counterexample,  then  G  is  directed  acyclic. 

Proof:  The  corollary  follows  immediately  from  Proposition  2.4  and  Lemma  5.3.  ■ 

Our  next  lemma  states  that  Nash  flows  in  minimal  counterexamples  are  unique,  up  to 
the  path  decomposition  of  the  induced  flow  on  edges  (cf.,  Corollary  2.3). 

Lemma  5.5  If  (G,r,£)  is  a  minimal  counterexample,  t  is  a  tax,  and  fT  is  a  Nash  flow  for 
(G,  r,  £  +  t),  then  f  is  a  Nash  flow  for  (G,  r,  £  +  r)  if  and  only  if  fe  =  ff  for  all  edges  e. 

Proof:  The  “if”  direction  is  Corollary  2.3.  For  the  “only  if”  direction,  suppose  for  contra¬ 
diction  that  /  is  a  Nash  flow  for  (G,  r,  £  +  r)  with  fe  ff  for  some  edge  e.  Put  ze  =  fe  —  ff 
for  each  edge  e.  Since  /  and  fT  are  flows  at  the  same  traffic  rate,  z  is  a  (signed)  circulation: 
for  each  vertex  v,  with  edges  <5+(u)  having  tail  v  and  edges  S~(v)  having  head  v, 

Ze=  J2  Ze- 

e£5+(^)  e£S~(v) 

By  Proposition  2.5(a),  ze  is  non-zero  only  when  e  has  a  constant  latency  function. 

Since  ( G,r,£ )  is  a  counterexample,  it  admits  a  good  tax  f.  (Recall  r  need  not  be  good.) 
Let  fT  be  a  Nash  flow  for  (G,  r,  £+t).  Since  ff  >  0  for  all  edges  e  by  Lemma  5.3,  {ff +\ze}e£E 
is  a  nonnegative  vector  for  A  sufficiently  near  zero  (positive  or  negative).  In  this  case,  it 
corresponds  (after  a  path  decomposition)  to  a  flow  fx  feasible  for  ( G,  r,£  +  t).  Moreover, 
since  ze  is  non- zero  only  when  e  possesses  a  constant  latency  function,  Proposition  2.2  implies 
that  fx  is  at  Nash  equilibrium  for  ( G,r,£  +  f).  Since  we  assumed  that  some  ze  is  non-zero, 
some  choice  of  A  yields  a  Nash  flow  fx  for  ( G,  r,£  +  f)  with  ff  =  0  for  some  edge  e.  This 
contradicts  Lemma  5.3.  ■ 

Finally,  we  strengthen  Proposition  2.6(a)  for  minimal  counterexamples. 

Lemma  5.6  Let  ( G,r,£ )  be  a  minimal  counterexample,  r  a  good  tax,  and  fT  a  Nash  flow 
for  (G,  r,  £  +  r).  Then  there  is  a  constant  c  such  that 

tp(fT)  +  TP  =  C 

for  every  s-t  path  P  of  G. 

Proof:  By  Lemma  5.3,  we  can  choose  a  path  decomposition  /  of  {ff}e&E  with  fP  >  0 
for  all  paths  P  E  V.  Corollary  2.3  implies  that  /  is  at  Nash  equilibrium  for  (G,  r,  £  +  r); 
Proposition  2.6(a)  implies  that  all  s-t  paths  have  a  common  latency  plus  tax.  ■ 
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5.1.3  Proof  of  Lemma  5.2 

Recall  that  our  key  lemma,  Lemma  5.2,  asserts  that  a  minimal  counterexample  admits 
a  minimal  tax.  Onr  proof  of  this  result  requires  two  technical  lemmas,  which  will  allow 
us  to  prove  the  existence  of  optimal  and  minimal  taxes  for  minimal  counterexamples  via 
compactness  arguments.  The  first  lemma  states  that  bounded  taxes  suffice  to  minimize  the 
cost  of  a  Nash  flow  in  a  minimal  counterexample. 

Lemma  5.7  Let  (G,r,£)  be  a  minimal  counterexample  and  r  a  good  tax.  Let  G  have  n 
vertices  and  define  £max  =  ma xeG£Te(r).  There  is  a  tax  t  with  maxefe  <  n£max  and 

C(f)  <  C(t). 

Proof:  Let  r  be  a  good  tax  for  (G,r,£),  with  fT  at  Nash  equilibrium  for  (G,r,£  +  r).  We 
next  show  how  to  decrease  taxes  while  leaving  fT  at  Nash  equilibrium. 

By  Corollary  5.4,  G  is  directed  acyclic  and  we  can  therefore  order  the  vertices  of  G  so 
that  all  edges  of  G  travel  forward.  Since  ff  >  0  on  all  edges  of  G  (Lemma  5.3),  s  is  the 
first  vertex  in  the  ordering  and  t  is  the  last.  Beginning  with  the  penultimate  vertex  and 
proceeding  backward  in  the  ordering,  we  perform  the  following  operation  for  each  vertex 
v  7^  s:  let  tv  >  0  denote  the  minimum  tax  on  an  edge  with  tail  v,  subtract  rv  from  the  tax  of 
every  edge  with  tail  v  and  add  tv  to  the  tax  of  every  edge  with  head  v.  This  operation  leaves 
the  total  tax  of  all  s-t  paths  and  the  cost  of  all  feasible  flows  unchanged.  In  particular,  the 
flow  fT  remains  at  Nash  equilibrium.  When  the  source  s  is  reached,  subtract  ts  from  the 
tax  on  all  edges  with  tail  s;  fT  remains  at  Nash  equilibrium  and  its  cost  can  only  decrease. 
Call  the  new  set  of  taxes  f.  We  have  already  argued  that  C'(t)  <  C(r);  it  remains  to  show 
that  maxefe  <  n£max. 

We  first  observe  that  the  tax-reducing  operations  iteratively  enforce  the  following  prop¬ 
erty:  every  vertex  other  than  t  has  an  outgoing  edge  with  zero  f-tax.  This  property  implies 
that  some  s-t  path,  say  Po ,  has  zero  r-tax.  Since  C(f)  <  C(r),  t  is  a  good  tax.  Since  fT  is 
at  Nash  equilibrium  for  ( G,r,£  +  f),  Lemma  5.6  implies  that 

£p(fT)  +  fP  =  £p0(fT) 

for  every  path  P  EV.  Since  £p0(fT)  <  n£max  and  every  edge  of  G  lies  on  some  s-t  path  (by 
Lemma  5.3),  no  edge  tax  in  f  can  exceed  n£max.  The  proof  is  complete.  ■ 

The  second  technical  lemma  asserts  continuity  of  the  map  r  h>  C(r). 

Lemma  5.8  Let  (G,  r,£)  be  a  minimal  counterexample.  Then  the  corresponding  map  r  i— ► 
C[t )  is  continuous. 

Proof:  Lemma  5.5  ensures  that  the  map  r  i— >  {ff}eeE  is  well  defined.  Lemma  5.5  and  a 
result  of  Dafermos  and  Nagurney  [11,  Theorem  3.1]  imply  that  this  map  is  continuous.  The 
lemma  then  follows  easily.  ■ 

We  are  finally  prepared  to  prove  Lemma  5.2. 
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Proof  of  Lemma  5.2:  We  first  show  that  a  minimal  counterexample  (G,  r,  £)  admits  an  optimal 
tax — that  infr>0  G(r)  is  attained  by  some  tax.  Let  B  denote  the  taxes  with  all  components 
bounded  by  n£max,  where  n  is  the  number  of  vertices  of  G  and  £max  =  ma xe£E£e(r).  Since 
(■ G,  r,£ )  is  a  counterexample,  the  value  inf  t>0G(t)  is  approached  by  good  taxes.  Since 
(■ G,  r,£ )  is  minimal,  Lemma  5.7  implies  that  infr>o  G(t)  is  approached  by  good  taxes  in  B : 

inf  C(t)  =  inf  G(t). 

t£B  V  7  r>0  7 

Since  B  is  a  compact  subset  of  1ZE  and  C  is  continuous  by  Lemma  5.8,  these  inhma  are 
attained  by  some  tax  (in  B). 

Let  O  denote  the  (non-empty)  set  of  optimal  taxes  for  ( G,r,£ ).  We  will  show  that 
infTeo  Y2e  Te  attained  by  some  optimal  tax.  By  Lemma  5.7,  there  is  an  optimal  tax  r  G  O 
with  Te  —  mn^max,  where  m  is  the  number  of  edges  of  G.  Since  O  is  the  inverse  image 

of  a  closed  set  under  a  continuous  map  (write  O  =  G_1(G(r ))  for  some  r  G  O),  it  is  a 
closed  subset  of  7ZE.  Restricting  O  to  taxes  with  sum  of  all  components  at  most  mn£max, 
we  obtain  a  nonempty  compact  subset  S  C  O  of  optimal  taxes.  Since  r  i— >•  re  is  a 

continuous  function,  it  attains  a  minimum  on  S;  this  is  also  its  minimum  on  O,  and  the 
proof  is  complete.  ■ 


5.1.4  Proof  of  Theorem  5.1 

With  Lemma  5.2  in  hand,  we  can  now  prove  Theorem  5.1  by  a  perturbation  argument. 

Proof  of  Theorem  5.1:  To  derive  a  contradiction,  let  ( G,r,£ )  be  a  minimal  counterexample, 
r  a  minimal  (and  hence  good)  tax,  and  fT  a  Nash  flow  for  ( G,r,£  +  r).  Since  (G,r,£)  is  a 
counterexample,  ^ere  >  0-  By  Lemma  5.6,  there  is  a  constant  cT  such  that  £p(fT)  +  Tp  =  cT 
for  all  s-t  paths  P  of  G. 

Write  £e(x)  =  aex  +  be  for  each  edge  e  of  G.  The  equations 


^[ae/e  +  be  +  Te)  =  C 

edP 


for  all  P  G  V,  together  with  the  standard  flow  conservation  constraints  for  /,  form  a  system 
of  equations  linear  in  the  m  +  1  variables  {fe}eeE  and  c  (where  m  is  the  number  of  edges  of 
G).  By  Lemma  5.5,  ({/J},  cr)  is  the  unique  solution  to  this  system,  with  flow  nonnegativity 
constraints  automatically  satisfied  (indeed,  strictly  by  Lemma  5.3).  Choosing  m  +  1  linearly 
independent  constraints  with  at  least  one  constraint  corresponding  to  a  path  with  nonzero 
tax,  there  is  a  square  linear  system 


A 


f 

c 


d 


for  which  ({ff},cT)  is  the  unique  solution,  namely  A~ld. 

Let  Te  >  0  be  a  positive  edge  tax  appearing  in  this  linear  system  (as  part  of  the  right-hand 
side  d).  Consider  perturbing  this  tax  by  subtracting  a  small  number  e  >  0.  This  translates 
to  a  perturbation  of  adding  e  to  the  right-hand  side  of  all  constraints  corresponding  to  paths 
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that  include  the  edge  e,  resulting  in  the  new  right-hand  side  d.  Since  fj  >  0  for  every  edge  e, 
the  new  Nash  flow  is  given  by 

({/e},c)  =  A~ld 

for  sufficiently  small  perturbations.  Since  r  is  an  optimal  tax  minimizing  ^ere,  subtracting 
e  >  0  from  produces  a  non-optimal  tax.  By  Proposition  2.6,  it  follows  that  cr  <  c.  By 
linearity,  however,  the  opposite  perturbation  of  adding  e  to  rg  has  the  opposite  effect,  pro¬ 
ducing  a  tax  f  that  induces  a  solution  ({/e},  c)  with  c  <  cT.  This  contradicts  the  optimality 
of  r,  and  the  proof  is  complete.  ■ 

5.2  The  Power  of  Taxes  in  General  Networks 

The  previous  subsection  showed  that  in  every  network  with  linear  latency  functions,  taxes 
cannot  improve  over  the  best  solution  attainable  by  removing  edges  from  the  network.  We 
now  demonstrate  that  this  result  does  not  extend  to  networks  with  nonlinear  latency  func¬ 
tions.  In  fact,  for  each  value  of  n  >  2,  there  is  an  n-node  network  in  which  arbitrary 
nonnegative  taxes  can  improve  upon  O/oo  taxes  by  an  |_n/2j  factor.  With  this  result,  we  will 
have  a  good  understanding  of  the  relationship  between  taxes  and  edge  removals  in  general 
networks.  Briefly,  removing  edges  can  improve  the  cost  of  a  Nash  flow  by  an  |_n/2j  factor 
(Proposition  2.8);  taxes  can  improve  the  cost  of  a  Nash  flow  by  an  |_n/2j  factor  beyond  what 
is  achievable  by  removing  edges  (Theorem  5.9  below);  but  taxes  (or  edge  removals)  cannot 
improve  the  cost  of  a  Nash  flow  by  more  than  an  \nj 2J  factor  (Theorem  4.5). 

Theorem  5.9  For  each  integer  n  >  2,  there  is  an  instance  (G,  r,£)  with  c(H,  r,£)  >  [n/2j 
for  all  subgraphs  H  of  G  but  c(G ,  r,  i  +  r)  =  1  for  some  tax  r  >  0. 

Proof:  The  construction  is  similar  to  that  in  an  inapproximability  result  for  network  de¬ 
sign  [36,  Theorem  4.3].  We  can  assume  that  n  is  even  and  at  least  4.  (For  n  odd,  add  an 
isolated  vertex  or  subdivide  an  edge.)  We  will  work  with  the  Braess  graph  Bk  for  which 
2k  +  2  =  n.  (See  Subsection  2.3  for  notation  and  terminology.) 

We  define  latency  functions  ik  on  the  graph  Bk  as  follows. 

(A)  Type  A  edges  receive  the  latency  function  £k(x)  =  0. 

(B)  Type  B  edges  are  given  a  (continuous,  nondecreasing)  latency  function  £k  satisfying 
lk{x)  —  1  for  x  <  l/(k  +  1)  and  lk{x)  =  n/2  for  x  >  l/(k  +  1)  +  e,  where  e  >  0  is  a 
sufficiently  small  constant. 

(C)  For  each  i  e  {1, the  type  C  edges  ( w^t )  and  (s,Vk-i+ 1)  receive  a  latency 
function  ik  satisfying  £k(x)  —  0  for  x  <  1  +  l/(k  +  1),  £k{l  +  1/k)  —  i,  and  tk{x)  =  n/2 
for  x  >  1  +  1/k  +  e. 

If  a  type  B  edge  carries  at  least  l/(k  +  1)  +  e  units  of  flow  or  a  type  C  edge  carries  at  least 
1  +  1/k  +  e  units  of  flow,  we  will  say  that  the  edge  is  oversaturated.  A  simple  but  important 
observation  is  that  if  a  Nash  flow  oversaturates  an  edge  in  (ih,  r,  lk)  for  some  subgraph  H  of 
Bk ,  then  c(H,  r,£k)  >  n/2. 
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(a)  Latencies  plus  taxes  in  a  Nash  flow 
for  (Bk,k  +  l,£k  +  t) 


(b)  Latencies  in  a  Nash  flow  for 
(Bk,k  +  1,P) 


Figure  4:  Proof  of  Theorem  5.9  when  k  —  3.  Solid  edges  carry  traffic  in  the  flow  at  Nash 
equilibrium,  dashed  edges  do  not.  Edges  are  labeled  with  their  cost  (sum  of  latency  and 
tax),  where  latencies  are  with  respect  to  flows  at  Nash  equilibrium. 


First,  let  r  be  the  tax  vector  equal  to  1  on  type  A  edges  and  0  elsewhere.  The  following 
flow  is  then  at  Nash  equilibrium  for  ( Bk,k  +  1,P  +  r):  route  1  unit  of  flow  on  each  of 
Pi,  P-2, . . . ,  Pk  and  l/(/c  T  1)  units  of  flow  on  each  of  Q i,  Q2, . . . ,  Qk+i ■  Type  A  edges  then 
each  have  zero  latency  and  one  unit  of  tax,  type  B  edges  each  have  zero  tax  and  one  unit 
of  latency,  and  type  C  edges  have  zero  latency  and  tax  (see  Figure  4(a)).  This  Nash  flow 
proves  that  c(Bk ,  k  +  1,  £k  +  t)  =  1. 

To  finish  the  proof,  we  need  to  show  that  c(ip  k  +  1,  £k)  >  n/2  for  every  subgraph  P[  of 
Bk ;  this  requires  a  bit  of  case  analysis.  If  H  is  all  of  Bk,  then  c(H ,  k  +  1,P)  =  n/2  because 
routing  1  +  1  jk  units  of  traffic  on  each  of  Pi,  P2,  ■ .  ■ ,  Pk  provides  a  flow  at  Nash  equilibrium 
(see  Figure  4(b)).  Similarly,  c(H,  k  +  1,P)  =  n/2  if  H  omits  only  type  B  edges. 

Next,  suppose  H  omits  some  type  C  edge  and  e  >  0  is  sufficiently  small.  If  edge  (s,+) 
is  not  in  H,  then  every  flow  feasible  for  (ip  k  +  1,  ik)  oversaturates  some  edge  incident  to  s. 
Thus  c(if,  k  +  1,P)  >  n/2  if  H  omits  a  type  C  edge  incident  to  s.  A  symmetric  argument 
applies  to  subgraphs  H  that  omit  a  type  C  edge  incident  to  t. 

Finally,  suppose  H  omits  some  type  A  edge,  say  ( t+uy ).  The  vertex  u*  then  has  at  most 
one  outgoing  edge  in  H ,  which  must  be  a  type  B  edge.  If  this  edge  is  not  oversaturated, 
then  the  edge  (s,  up  carries  at  most  A--  +  e  units  of  flow;  as  in  the  previous  paragraph,  this 
implies  that  some  edge  incident  to  s  is  oversaturated.  In  either  case,  c(H ,  k  +  1,P)  >  n/2 
and  the  proof  is  complete.  ■ 
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6  The  Complexity  of  Computing  Optimal  Taxes 

In  this  section,  we  study  the  optimization  problem  of  minimizing  the  cost  of  a  Nash  flow  by 
taxing  the  network  edges,  and  extend  an  existing  hardness  result  for  network  design  [36]  to 
this  problem. 

By  an  a- approximation  algorithm,  for  a  minimization  problem,  we  mean  an  algorithm 
that  runs  in  polynomial  time  and  returns  a  solution  no  more  than  a  times  as  costly  as  an 
optimal  solution.  We  will  call  the  value  a  the  approximation  ratio  or  performance  guarantee 
of  the  algorithm. 

The  maximum-possible  benefit  achievable  with  taxes,  as  determined  in  Section  4,  has  im¬ 
mediate  consequences  for  the  performance  guarantee  of  the  trivial  algorithm — the  algorithm 
that  assigns  all  edges  zero  tax.  In  particular,  Corollary  4.3  implies  the  following. 

Corollary  6.1  The  trivial  algorithm  is  a  |- approximation  algorithm  for  the  problem  of  tax¬ 
ing  edges  to  minimize  the  cost  of  a  Nash  flow  in  networks  with  linear  latency  functions. 

Roughgarden  [36]  gave  several  inapproximability  results  for  the  problem  of  removing 
edges  from  a  network  to  minimize  the  total  latency  of  a  Nash  flow.  We  next  extend  one  of 
them  to  the  problem  of  computing  optimal  taxes  in  networks  with  linear  latency  functions. 

Theorem  6.2  For  every  e  >  0,  there  is  no  (|  —  e) -approximation  algorithm  for  the  problem 
of  taxing  edges  to  minimize  the  cost  of  a  Nash  flow  in  networks  with  linear  latency  functions 
(unless  P  =  NP). 

Corollary  6.1  and  Theorem  6.2  imply  that,  in  networks  with  linear  latency  functions,  no 
polynomial-time  algorithm  has  approximation  ratio  better  than  that  of  the  trivial  algorithm 
(assuming  P  NP). 

The  reduction  used  to  prove  Theorem  6.2  is  identical  to  one  in  [36],  and  is  from  a  disjoint 
paths  problem.  However,  the  proof  of  Theorem  6.2  is  slightly  more  involved  than  that  of  its 
analogue  in  [36],  due  to  the  extra  power  of  taxes  beyond  that  of  edge  removals.  In  particular, 
proving  that  “no”  instances  of  a  disjoint  paths  problem  give  instances  in  which  all  possible 
taxes  induce  a  costly  Nash  flow  is  harder  than  showing  that  all  possible  subgraphs  of  these 
instances  have  costly  Nash  flows. 

For  completeness,  we  prove  Theorem  6.2  in  Appendix  B. 

Remark  6.3  For  the  problem  of  removing  edges  from  a  network  to  minimize  the  total 
latency  of  a  Nash  flow,  Roughgarden  [36]  also  gave  inapproximability  results  for  networks 
with  different  types  of  nonlinear  latency  functions.  These  showed  that  the  trivial  algorithm 
is  an  optimal  approximation  algorithm  for  the  problem  in  these  classes  of  networks.  We 
believe  that  most  (if  not  all)  of  these  hardness  results  should  carry  over  to  the  problem  of 
computing  optimal  taxes  in  networks  with  nonlinear  latency  functions,  but  we  have  been 
unable  to  verify  the  details. 
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7  Directions  for  Further  Research 


We  have  undertaken  the  first  study  of  using  edge  taxes  to  minimize  the  cost — latency  plus 
taxes  paid — of  a  Nash  flow.  While  we  have  answered  several  basic  questions,  some  obvious 
gaps  in  our  results  remain.  We  next  list  three  of  the  more  glaring  ones. 

(Ql)  Suppose  we  consider  restricted  latency  functions  that  need  not  be  linear,  such  as  convex 
functions  or  degree-bounded  polynomials  with  nonnegative  coefficients.  Can  marginal 
cost  taxes  improve  the  cost  of  a  Nash  flow?  Can  levying  taxes  decrease  the  cost  of  a 
Nash  flow  beyond  what  is  achievable  with  edge  removals? 

(Q2)  Is  the  trivial  algorithm  an  optimal  approximation  algorithm  for  the  problem  of  com¬ 
puting  optimal  taxes  in  networks  with  nonlinear  latency  functions? 

(Q3)  Which  results  of  this  paper  remain  true  in  multicommodity  flow  networks,  with  mul¬ 
tiple  sources  and  destinations?  (See  Lin  et  al.  [29]  for  some  very  recent  work  on  this 
question.) 

A  broader  research  issue  is  to  study  other  approaches  to  simultaneously  minimizing  both 
the  total  latency  and  the  taxes  paid  by  traffic.  For  example,  Hearn  and  Ramana  [22]  show 
how  to  efficiently  compute,  among  all  tax  vectors  that  induce  a  minimum- latency  flow  as  a 
flow  at  Nash  equilibrium,  the  tax  vector  minimizing  the  amount  of  tax  paid  by  the  traffic. 
(Of  course,  even  the  best  such  tax  vector  might  require  traffic  to  pay  exorbitant  taxes.)  A 
different  objective  is  to  minimize  the  total  latency  of  traffic,  subject  to  a  fixed  budget  on  the 
amount  of  taxes  that  can  be  paid.  Are  there  non-trivial  algorithmic  results  for  this  problem? 
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A  Proof  of  Theorem  4.5 

Proof  of  Theorem  4-5:  Let  ( G,r,I )  be  an  instance  in  which  G  has  n  vertices,  and  let  r  be 

a  tax  on  the  edges.  Let  /  be  a  directed  acyclic  Nash  flow  for  (G,  r,  I)  (see  Proposition  2.4), 

and  f*  a  Nash  flow  for  ( G,r,i  +  r).  Let  d  and  d*  be  the  corresponding  distance  labels  of 
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Proposition  2.2.  In  the  notation  of  Proposition  2.6,  we  need  to  prove  that  D  =  d{t)  = 
c(G,r,  £)  is  at  most  [n/2j  times  D*  =  d*(t )  =  c(G,r,l  +  r). 

An  ordering  of  the  vertices  of  G  is  good  if  it  satisfies  the  following  two  properties. 

(PI)  All  /-flow  travels  forward  in  the  ordering. 

(P2)  The  d-values  of  vertices  are  nondecreasing  in  the  ordering. 

There  is  at  least  one  good  ordering.  To  see  why,  first  topologically  sort  the  vertices  of  G 
according  to  the  (directed  acyclic)  flow  /  to  ensure  property  (PI).  An  ordered  pair  (u,v) 
of  vertices  is  bad  if  d(v)  <  d(u )  in  spite  of  v  following  u  in  the  ordering.  Property  (P2)  is 
equivalent  to  the  absence  of  bad  vertex  pairs. 

If  there  is  a  bad  vertex  pair,  there  is  one  such  pair  ( v,w )  with  v  and  w  adjacent  in  the 
ordering.  We  would  like  to  transpose  v  and  w.  By  Proposition  2.2  and  the  nonnegativity  of 
latency  functions,  d-values  cannot  decrease  across  an  edge  e  with  fe  >  0.  Hence,  there  is  no 
flow-carrying  edge  from  u  to  w.  Transposing  v  and  w  therefore  does  not  violate  property  (PI) 
and  strictly  decreases  the  number  of  bad  vertex  pairs.  Finitely  many  such  transpositions 
yields  a  good  ordering. 

Place  a  good  ordering  on  the  vertices  of  G  and  label  them  UoAh,  •  •  • ,  W-i  accordingly. 
We  can  assume  that  vq  =  s.  Call  an  edge  e  of  G  light  if  fe  <  f*  with  f*  >  0,  and  heavy 
otherwise.  We  can  finish  the  proof  by  establishing  two  claims  (see  also  Figure  5). 

(1)  If  Vj  precedes  t  in  the  good  ordering,  then  there  is  a  path  of  light  edges  beginning  in 
{u0,  Vi, . . .  ,  Vj}  and  terminating  in  {t,  vj+2,  vj+3,  ■  ■  ■ ,  un_i}. 

(2)  If  there  is  a  path  of  light  edges  from  u  to  v,  then  d(v)  <  d(u)  +  D*. 

Since  d(s)  =  0  and  d- values  are  nondecreasing  in  the  good  ordering,  applying  these  two 
claims  inductively  to  the  sets  {u0, . . . ,  v2i }  gives  d(v 2i)  <i-D*  for  v2i  equal  to  or  preceding  t. 
If  t  =  v2 i  for  an  integer  i,  the  theorem  follows  immediately.  If  t  =  U2?.+i,  then  d{v2j)  <  i  ■  D* 
and  the  theorem  follows  from  one  further  application  of  the  two  claims  (to  {uq,  •  •  •  ,^2i})- 


1(f)  <L*  1(f)  < 


d=0  d  <  L  d  <  2L 


Figure  5:  Proof  of  Theorem  4.5.  If  /  is  the  flow  sending  one  unit  of  flow  on  the  four-hop 
path  and  /*  is  the  flow  sending  half  a  unit  of  flow  on  each  of  the  other  two  paths,  then  the 
dashed  edges  are  light. 


To  prove  the  first  claim,  let  Vj  precede  t  in  the  good  ordering.  By  property  (PI)  of  good 
orderings,  no  /-flow  enters  the  s-t  cut  S  =  {uq,-  •  • ,  Vj}.  Since  the  net  /-flow  and  /*-flow 
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escaping  any  s-t  cut  is  precisely  r  (see  e.g.  [43,  Lemma  8.1]),  at  least  one  light  edge  escapes 
S.  If  some  such  edge  has  its  head  in  {t,Vj+2,  ■  ■  ■ ,  un_i},  we  are  done.  If  not,  all  such  light 
edges  terminate  at  a  vertex  v]+\  that  precedes  t  in  the  ordering.  By  the  above  argument, 
some  light  edge  e  escapes  (no, . . . ,  v]+\  }.  Since  all  light  edges  emanating  from  S  end  at  Vj+ 1, 
e  begins  at  Vj+ 1.  Thus,  its  concatenation  with  any  light  edge  escaping  S  provides  the  desired 
path  of  light  edges. 

For  the  second  claim,  let  P  be  a  path  of  light  edges  from  u  to  v.  Since  taxes  are 
nonnegative  and  latency  functions  are  nondecreasing,  £e(fe)  <  £e(ff)  +  Te  for  every  edge  e 
in  P.  Since  f*  >  0  for  all  edges  e  of  P,  Proposition  2.2  implies  that 

d(v)  -  d(u )  <  ^2  We)  <  [We)  +  Te\  =  d*  (v)  ~  d*  (u)  ■ 

e£P  e€ P 


Similarly, 


and  hence  d(v)  —  d(u)  <  d*(v) 
hence  the  theorem.  ■ 


0  <  d*(u)  <  d*(v)  <  D* 

—  d*(u )  <  D*.  This  completes  the  proof  of  the  claim  and 


B  Proof  of  Theorem  6.2 

Proof  of  Theorem  6.2:  As  in  [36,  Theorem  3.3],  our  reduction  will  proceed  from  the  problem 
2  Directed  Disjoint  Paths  (2DDP):  given  a  directed  graph  G  =  ( V,E )  and  distinct 
vertices  si,  s2,  h,  t2  €  V,  are  there  Sj-tj  paths  Pt  for  i  —  1,  2  such  that  Pi  and  P2  are  vertex- 
disjoint?  Fortune,  Hopcroft,  and  Wyllic  [19]  showed  that  this  problem  is  NP-complete.  We 
will  show  how  a  (|  —  e)-approximation  algorithm  for  minimizing  the  cost  of  a  Nash  flow  with 
taxes  in  a  network  with  linear  latency  functions  can  be  used  to  distinguish  “yes”  and  “no” 
instances  of  2DDP  in  polynomial  time. 

Let  T  —  (G  —  (V,E),si,s2,ti,t2)  be  an  instance  of  2DDP.  Augment  the  vertex  set  V 
by  a  source  s,  a  sink  t,  and  directed  edges  (s,  si),  (s,  s2),  (ti,t),  and  (t2,t)  (see  Figure  6). 

Denote  the  new  network  by  G'  =  (V',E').  We  give  the  edges  of  E’  the  following  (linear) 

latency  functions:  all  edges  of  E  receive  the  latency  function  l(x)  =  0,  edges  (s,  s2)  and 
(ti,t)  are  given  the  latency  function  £(x)  =  x,  and  edges  (s,  si)  and  (t2,t)  are  endowed  with 
the  latency  function  £(x)  =  1. 

Following  the  notation  of  Subsection  5.1,  we  will  write  C{r)  for  the  cost  of  a  Nash  flow 
in  (G',  1  ,£  +  t).  To  complete  the  proof,  we  need  only  show  the  following  two  statements: 

(i)  if  X  is  a  “yes”  instance  of  2DDP,  then  there  is  a  set  r  of  taxes  with  C{r)  =  3/2; 

(ii)  if  X  is  a  “no”  instance,  then  C(r)  >2  for  all  nonnegative  taxes  r. 

To  prove  (i),  let  Pi  and  P2  be  vertex-disjoint  s\-t\  and  s2-t2  paths  in  G,  respectively. 
Assign  zero  taxes  to  edges  in  E'  \  E,  Pi,  and  P2.  Set  taxes  re  on  all  other  edges  e  to  be 
sufficiently  large.  Then,  routing  half  a  unit  of  flow  on  each  of  Pi  and  P2  yields  a  flow  at 
Nash  equilibrium  proving  that  C(t)  =  3/2. 
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Figure  6:  Proof  of  Theorem  6.2.  In  a  “no”  instance  of  2DDP,  si-fi  and  s2-t2  paths  must 
share  a  vertex. 


For  (ii),  consider  taxes  r  and  a  Nash  flow  fT  for  (G',  1  ,£  +  r).  Let  d  denote  the  distance 
labels  of  Proposition  2.2.  Since  there  is  one  unit  of  traffic,  (ii)  is  tantamount  to  showing  that 
d(t)  >  2. 

First,  if  fT  routes  all  flow  through  a  single  pair,  then  the  latency  of  all  flow  paths  is 
2  and  hence  d(t)  >  2.  We  can  thus  assume  that  fT  routes  flow  through  at  least  two  distinct 
Si-tj  pairs.  Similarly,  we  can  assume  that  no  flow  path  contains  both  .s'i  and  t2. 

Now  suppose  fT  routes  flow  on  a  path  P\  containing  si  and  t\  and  also  on  a  path  P2 
containing  s2  and  1 2.  Since  X  is  a  “no”  instance  of  2DDP,  P\  and  P2  share  an  internal  vertex 
v  (see  Figure  6).  Since  P\  is  a  flow  path  of  the  Nash  flow  fT  that  contains  v  and  the  edge 
(s, Si),  Proposition  2.2  implies  that  d(v)  >  1.  Similarly,  since  P2  is  a  flow  path  containing  v 
and  the  edge  (t2,t),  Proposition  2.2  implies  that  d(t)  >  d(v)  +  1  >  2. 

The  final  case  arises  when  all  flow  paths  contain  a  single  ,st  or  a  single  tj  (but  not 
both).  If  all  flow  paths  contain  a  single  s*,  then  all  flow  uses  edge  ( s,Si ),  so  d(si )  >  1.  As 
in  the  previous  paragraph,  since  there  is  a  flow  path  of  fT  containing  Si  and  edge  (t2,t), 
Proposition  2.2  implies  that  d(t)  >  d(si )  +  1  >  2.  A  similar  argument  proves  that  d(t)  >  2 
if  all  flow  paths  of  fT  contain  a  single  ■ 
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